Multivariate scan statistics for disease surveillance.
نویسندگان
چکیده
In disease surveillance, there are often many different data sets or data groupings for which we wish to do surveillance. If each data set is analysed separately rather than combined, the statistical power to detect an outbreak that is present in all data sets may suffer due to low numbers in each. On the other hand, if the data sets are added by taking the sum of the counts, then a signal that is primarily present in one data set may be hidden due to random noise in the other data sets. In this paper, we present an extension of the spatial and space-time scan statistic that simultaneously incorporates multiple data sets into a single likelihood function, so that a signal is generated whether it occurs in only one or in multiple data sets. This is done by defining the combined log likelihood as the sum of the individual log likelihoods for those data sets for which the observed case count is more than the expected. We also present another extension, where the concept of combining likelihoods from different data sets is used to adjust for covariates. Using data from the National Bioterrorism Syndromic Surveillance Demonstration Project, we illustrate the new method using physician telephone calls, regular physician visits and urgent care visits by Harvard Pilgrim Health Care members cared for by Harvard Vanguard Medical Associates, a large multi-specialty group practice in Massachusetts. For upper and lower gastrointestinal (GI) illness, there were on average 20 telephone calls, nine urgent care visits and 22 regular physician visits per day. The strongest signal was generated by a single data set and due to a familial outbreak of pinworm disease. The second and third strongest signals were generated by the combined strength of two of the three data sets.
منابع مشابه
Fast subset scan for multivariate event detection.
We present new subset scan methods for multivariate event detection in massive space-time datasets. We extend the recently proposed 'fast subset scan' framework from univariate to multivariate data, enabling computationally efficient detection of irregular space-time clusters even when the numbers of spatial locations and data streams are large. For two variants of the multivariate subset scan,...
متن کاملOn the limiting distribution of the spatial scan statistic
Bootstrap is the standard method in the spatial scan test. However, because the spatial scan statistic lacks theoretical properties, its development and connection to mainstream statistics has been limited. Using the methods of empirical processes with a few weak regularity conditions, the limiting distribution of the spatial scan statistic, which can provide a theoretical basis for the spatial...
متن کاملA Nonparametric Scan Statistic for Multivariate Disease Surveillance
OBJECTIVE We present a new method for multivariate outbreak detection, the “nonparametric scan statistic” (NPSS). NPSS enables fast and accurate detection of emerging space-time clusters using multiple disparate data streams, including nontraditional data sources where standard parametric model assumptions are incorrect. BACKGROUND Expectation-based scan statistics [1] extend the traditional sp...
متن کاملComparison of covariate adjustment methods using space-time scan statistics for food animal syndromic surveillance
BACKGROUND Abattoir condemnation data show promise as a rich source of data for syndromic surveillance of both animal and zoonotic diseases. However, inherent characteristics of abattoir condemnation data can bias results from space-time cluster detection methods for disease surveillance, and may need to be accounted for using various adjustment methods. The objective of this study was to compa...
متن کاملA flexibly shaped space-time scan statistic for disease outbreak detection and monitoring
BACKGROUND Early detection of disease outbreaks enables public health officials to implement disease control and prevention measures at the earliest possible time. A time periodic geographical disease surveillance system based on a cylindrical space-time scan statistic has been used extensively for disease surveillance along with the SaTScan software. In the purely spatial setting, many differe...
متن کاملSpatial scan statistics in loglinear models
The likelihood ratio spatial scan statistic has been widely used in spatial disease surveillance and spatial cluster detection applications. In order to better understand cluster mechanisms, an equivalent model-based approach is proposed to the spatial scan statistic that unifies currently loosely coupled methods for including ecological covariates in the spatial scan test. In addition, the uti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Statistics in medicine
دوره 26 8 شماره
صفحات -
تاریخ انتشار 2007